A Fresh Look at Coding for 
g-ary Symmetric Channels 

Claudio Weidmann and Gottfried Lechner 



Abstract 

This paper studies coding schemes for the g-ary symmetric channel based on binary low-density parity-check (LDPC) codes 
that work for any alphabet size q — 2 m , m 6 N, thus complementing some recently proposed packet-based schemes requiring 
large q. First, theoretical optimality of a simple layered scheme is shown, then a practical coding scheme based on a simple 
modification of standard binary LDPC decoding is proposed. The decoder is derived once from first principles and once using a 
factor-graph representation of a front-end that maps g-ary symbols to groups of m bits connected to a binary code. The front-end 
can be processed with a complexity that is linear in m — log 2 g. An extrinsic information transfer chart analysis is carried out 
and used for code optimization. Finally, it is shown how the same decoder structure can also be applied to a larger class of g-ary 
channels. 

^ ' Index Terms 
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g-ary symmetric channel, low-density parity-check (LDPC) codes, decoder front-end. 

I. Introduction 

HE g-ary symmetric channel (y-SC) with error probability e takes a g-ary symbol at its input and outputs either the 
unchanged input symbol, with probability 1 — e, or one of the other q — 1 symbols, with probability e/(q — 1). It has 
attracted some attention recently as a more general channel model for packet-based error correction. For very large q, its 
appropriately normalized capacity approaches that of an erasure (packet loss) channel. In the following, we will only consider 
channel alphabets of size q = 2 m with m £ N. 

The capacity of the q-SC with error probability e is 

0\ '■ C i-sc = m - h(e) - e log 2 (2™ - 1) 

O 

' bits per channel use, where h(x) = —xlog 2 x — (1 — x) log 2 (l — x) is the binary entropy function. Asymptotically in m, the 
normalized capacity C g -sc / m thus approaches 1 — e, which is the capacity of the binary erasure channel (BEC) with erasure 
■ probability e. 

Recent work [l]-[5] has shown that it is possible to approach C q sc for large alphabet sizes q — 2 m , with symbols of 
hundreds to thousands of bits, and complexity O(logq) per code symbol. The focus of the present work is on smaller q, with 
symbols of tens of bits at most, although the presented coding techniques will work for any q = 2 m . 

The g-ary channel input and output symbols will be represented by binary vectors of length m. Hence a simplistic coding 
approach consists in decomposing the q-SC into to binary symmetric channels (BSCs) with crossover probability 
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£ bsc 2{q-\) 2(1-2-™)' (1) 

which have capacity Cbsc = 1 — h(eBSc) each. 

We briefly study the normalized capacity loss, A = C q sc/m — Cbsc, which results from (wrongly) assuming that the q-SC 
is composed of independent BSCs. For fixed to, we have lim e ^o A = 0; so using binary codes with independent decoders on 
the m-fold BSC decomposition might be good enough for small e (e.g. e < 10 -3 ). However, Figure Q] shows that the relative 
capacity loss mA/C q sc — 1 — toCbsc /C q sc increases close to linearly in e. For fixed e, we have linim^oo A = h(e/2) — e, 
which can be a substantial fraction of the normalized g-SC capacity (e.g., for e = 0.1, h(e/2) — e = 0.19). Figure [2] shows that 
already for small to, the q-SC capacity is substantially larger than what can be achieved with the BSC decomposition. Clearly, 
there is a need for coding schemes targeted at "large" e (say e > 10 _1 ), but moderate to (say 2 < to < 20), which are not 
that well handled by methods for large q. For example, when using a verification-based decoder and coding over blocks of n 
symbols, the symbols should have to = bits [1]. 
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Fig. 1. Relative capacity loss 1 



of marginal BSC vs. g-SC. 
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Fig. 2. Capacity of q-SC vs. marginal BSC (error probability e = 0.1). 



One example application, which also motivated this work, is Slepian-Wolf coding of q-axy sources with a discrete impulse- 
noise correlation model, using q-SC channel code syndromes as fixed-rate (block) source code. This can then be used as a 
building block for a Wyner-Ziv coding scheme, where q is the number of scalar quantizer levels, or for fixed-rate quantization 
of sparse signals with a Bernoulli-e prior on being nonzero [6]. Clearly, q will be only moderately large in such a scenario. 

The capacity loss of the binary decomposition is due to the fact that the correlation between errors on the binary sub-channels 
is not taken into account, i.e., since an error on one sub-channel of the q-SC implies a symbol error, it will be more likely 
for the other sub-channels to be in error as well. A better approach would be the use of non-binary low-density parity-check 
(LDPC) codes over GF(g) [7]. While that would take into account the dependency between bit errors within a symbol, the 
decoding complexity of the associated non-binary LDPC decoder is 0(q ■ \ogq) or at least O(q) when using sub-optimal 
algorithms [8]. 

Instead, this work focuses on a modified binary LDPC decoder of complexity 0(\ogq). Section ITT1 studies an ideal scheme 
using layers of different-rate binary codes, providing the key intuition that once a bit error is detected, the remaining bits of 
the symbol may be treated as erasures without loss in rate. Section [Til] then proposes a scheme using a single binary code 
and develops the new variable node decoding rules from first principles, by factorizing the posterior probabilities. Section HVl 
shows that the new decoding rule is equivalent to a front-end that maps g-ary symbols to groups of m bits and studies its 
factor graph representation. Section [V] provides the extrinsic information transfer chart characterization of this q-SC front-end, 
which is used in Section [VI] to design optimized LDPC codes. Finally, Section [VTIl shows that these decoding methods extend 
to a larger class of q-ary channels. 



II. Layered Coding Scheme 
We study the following layered coding scheme based on binary codes. Blocks of k symbols 



bit layers [uj, uf, . 



u i , . . . , u ] are split into m 
*], i = 1, . . . , m, and each layer is independently encoded with a code for a binary symmetric erasure 
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channel (BSEC) with erasure probability S{ and crossover probability e^, to be specified below. The channel input symbol will 
be denoted ] T , where x\ is the i-th bit of the j-th symbol, while the corresponding channel output is 

V J = [y{,yi,---, VLV^ respectively y{. 

The key idea is to decode the layers in a fixed order and to declare bit erasures at those symbol positions in which a bit 
error occurred in a previously decoded layer. This saves on the code redundancy needed in the later layers, since erasures can 
be corrected with less redundancy than bit errors. 

The decoder performs successive decoding of the m layers, starting from layer 1 . All errors corrected at layer i and below are 
forwarded to layer i + 1 as erasures, that is all bit error positions found in layers 1 up to i will be marked as erased in layer i + 
even though the channel provides a (possibly correct) binary output for those positions. Let ej be the probability that the channel 
output y is equal to the input x in bit positions 1 to i — 1 and differs in position i (i.e. [2/1,2/2, ■ • • , J/i-i] T = [ x i, %2, ■ ■ ■ , £i-i] T 
and yi =^ xi). A simple counting argument shows that there are 2 m ~ l such binary vectors y ^ x, out of a total 2 m — 1. The 
i-th binary sub-channel is thus characterized by 

cym—i 

(2) 



2 m - 1 ' 

* — 1 2 m ^rn — i+l 

Theorem 1 The layered scheme achieves q-SC capacity. 

Proof: We assume an ideal scheme, in which all layers operate at their respective BSEC capacities and correct all errors 
and erasures. The BSEC(<5i,£i) capacity is 



C BSE c = (l-Si)\l-h\—^-jj. (4) 

Hence the sum of the layer rates becomes 

m m , , v v 

Si - (I- <5 l )log 2 (l - Si) + e l \og 2 e i + (1 - <5 l+ i)log 2 (l - (6) 
^2 { - ( m - + £ t lo &2 + (1 -P) log 2 (l - p) (7) 



E 



= m + E { ~ ( TO ~ *) £i + ( m - e *} +plog 2 ?J -plog 2 (2 m - 1) + (1 -p) log 2 (l -p) (8) 

i=l 

- m - ft(p) - plog 2 (2 m - 1) = C,. sc , 

where (O follows from dU and the definition of the layered scheme, (O follows from Si + e; = (which holds up to i — m, 
when S m +i = e), (O follows from the evaluation of the telescoping sum and YllLi — Efcli( m — *) e *' anc l © follows from 
substituting (f2]i for e,. ■ 

The intuition behind the optimality of this (seemingly suboptimal) layered scheme is that once a bit error (and thus a symbol 
error) has been detected, all the following layers have bit error probability 1/2 in that position, since the g-SC assigns uniform 
probabilities over the possible symbol error values. Now the BSC(l/2) has zero capacity and so the concerned bits can be 
treated as erasures with no loss. 

Since the capacity gain relative to m — 1 becomes smaller with increasing m, as can be seen in Figure |2] an interesting 
variant of the layered scheme is to use \i < m BSEC layers as above, followed by a single "thicker" layer, which sends the 
remaining m — /j, bits (per symbol) over the same BSEC(<5 M +i, e^+i). In particular, this could even be beneficial in practical 
implementations, since combining layers leads to longer codewords and thus better codes, which might outweigh the theoretical 
capacity loss. The next section will show that it is actually possible to reap the benefits of a single large binary code, without 
layers, by using a decoder that exploits the dependencies among the bits in a symbol. 

III. Bit-Symmetric Coding Scheme 

The main disadvantage of the layered scheme is that in a practical implementation, each layer will need a different code 
that is tuned to the effective erasure and error probabilities resulting from the layers preceding it. This makes it impractical 
for hardware implementation, since the required silicon area would necessarily grow with the number of layers. 
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Ideally, a coding scheme for the g-SC should be symmetric in the bits composing a symbol, that is, no artificial hierarchy 
among bit layers should be introduced (notice that the order of the bit layers may be chosen arbitrarily). We propose to encode 
all bits composing the symbols with one "big" binary code, which needs to satisfy just slightly stricter constraints than an 
ordinary code for the BSC, while the decoder alone will exploit the knowledge about the underlying q-SC. The key concept 
that should carry over from the layered scheme is that the decoder is able to declare erasures at certain symbol positions and 
thus needs less error correction capability (for part of the bits in erased symbols). 

Our proposal for a practical symmetric q-SC coding scheme relies on a LDPC code with information block size K = mk bits 
and channel block size N = mn bits. We assume that the variable nodes (VNs) in the decoder receive independent extrinsic soft 
estimates of X\ (that is, bit i of code symbol/vector X 3 , for i = 1, . . . , m, j = 1, . . . , n) from the check nodes (CNs). These 
amount to estimates of P(Xf\Y^ = y^ ) or the corresponding log-likelihood ratio (LLR), L(X) — log(Pr(X = 0)/Pr(X = 
1)). (As usual, the notation y"' denotes the block consisting of all symbols/vectors except the j-th.) In particular, the extrinsic 
estimate of X\ is assumed to be independent of the other bits of the same symbol, so that we may write 

P(Xf\X(, . . . , Xl ^Y j- = y W) = P(X||YM = y W). 

In the standard case, these independence assumptions are justified by the fact that asymptotically in the block length, the 
neighborhood of a VN in the LDPC decoder computation graph becomes a tree [9, Chap. 3]. Unfortunately, the g-SC VN 
message computation rule has to depend on the other bits in the same symbol, in order to account for the bit error correlation, 
and thus will introduce cycles. However, this problem can be alleviated by imposing an additional constraint on the code, 
namely that the parity checks containing X\ do not involve any of the bits Xl-,. This is a necessary condition for the above 
intra-symbol independence assumption and may be achieved by using an appropriate edge interleaver in the LDPC construction. 
Then the cycles introduced by the VN message rule will grow asymptotically and are thus not expected to lead to problems 
in practice. 

As suggested above, the properties of the q-SC can be taken into account via a simple modification of the VN computation 
in the message-passing decoding algorithm for binary LDPC codes. We factor the a posteriori probability (APP) of symbol 
Xi as follows: 

P(X j \Y = y) = P(Y j = y j \X j )P(X j \Y^ = yW) 

m 

= P(Y j = y*\X*) J] P(X(\YW = yW), (9) 
i=i 

where the factorization in (O is made possible by the above independence assumption (the symbol = denotes equality up to 
a positive normalization constant). Using the definition of the q-SC, this becomes 

We define the extrinsic probability that = yi as 

m m. 

i' 1 £ /' v /; ;Y ' >•', \[,>: 

i=l 1=1 

where we introduced 

p\ = P(Xi = yl\YW =y \J]) 
for notational convenience. The normalization constant in ( TTOb thus becomes 

7* = (l_ e )/3»' + -l I (l_^). 

Then the bit APP may be obtained by the marginalization 

P(X{=4\Y = y)= ]T P(Xi =x j \Y = y), (11) 

xf i) e{o,i} m - 1 



which may be written as 



. (i-e)fl?„ + _i T (1-/3/ 
P{Xf =^|Y = y) = > K )f[l] q ~ lK fb 



Pi i _ , j 



q— l 7J 



(12) 
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where j$L Jp^ is the "intra-symbol" extrinsic probability that X 3 
express the a posteriori bit-level LLR as 



yi , using no information on bit Xf. Finally, we may 



L app (X|=^)=log 



log 



log 1 



q-qe-1 



1 ~Pi J 
pQj+L c ^{Xl=yl). 



(13) 



The usual L{X) is obtained from L(X = y) = log(Pr(X = y)/Pr(X = l - y)) via a sign flip, L{X) = (1 - 2y)L(X = y). 

The second term in ( fT3l corresponds to the extrinsic information from the CNs that is processed at the VNs in order to 
compute the bit APP in standard binary LDPC decoding. The difference lies in the first term in (|131 l, which corresponds to the 
channel LLR (which would be L c h(X) = \og(P(y\X — Q)/P(y\X = 1)) in the binary case). When the extrinsic information 
on the bits X?„ favors the hypothesis X 3 ^ t/ J , the product f$L will be small and therefore L c h in ( [T3l will be close to zero, 
which is equivalent to declaring a bit erasure. This shows that the symmetric LDPC scheme relies on "distributed" soft bit 
erasure estimates, while in the layered scheme the erasures are declared in a hard "top-down" fashion. 

Equation (fT3l l describes the modification of the VN computation that turns a message-passing binary LDPC decoder into 
one for the g-ary symmetric channel. The outgoing VN messages are computed as usual by subtracting the incoming edge 
message from L app (X^); also the CN messages are the same as in the binary case. For practical implementation purposes, 
dl3T > should probably be modified (approximated) in order to avoid switching back and forth between probabilities and LLRs 
when computing . A final detail is the specification of the initial channel LLR in 113) . which is needed to start the 
decoder iterations. By inserting the memoryless worst-case estimate /3L = 2 _m+1 into ( TT3l . we obtain 



4 0) = log 



2(1 



(14) 



which is exactly the channel LLR for the marginal BSC with crossover probability esse given in JT). 

Notice that the decoder iterations are exclusively between VN dot and CN computations, like in the binary case. However, 
computing ( fT3l at the VNs requires the extrinsic information (the CN messages) for all bits within a symbol; this might be 
considered an additional level of message exchanges (specifically, plain copying of messages), but it does not involve iterations 
of any kind. The complexity increase compared to binary LDPC decoding is on the order of at most m operations per variable 
node, depending on the scheduling. In fact, the marginalization (fTTT) is reminiscent of a combined detector and VN decoder 
for binary LDPC codes that are directly mapped to larger signal constellations [10]. Thanks to the symmetry of the q-SC, here 
it is not necessary to actually sum over all q symbol values. Other similar work includes iterative demapping and decoding 
[11], [12], which however involves proper iterations between the demapper and the LDPC decoder, being treated as separate 
functional blocks. The next section will take that point of view. 



IV. Factor Graph Representation of a q-SC Front-end 

This section presents a different view on the bit-symmetric coding scheme by considering a q-SC front-end for an LDPC 
decoder, similar to approaches for iterative demapping and decoding [11], [12]. The front-end takes into account the correlation 
between the errors on the m bit layers. Its factor graph representation allows to formulate a message passing algorithm that 
computes essentially the same quantities as the "direct" algorithm in Sec. [TTTl but displays more clearly the opportunity for 
further complexity reduction by appropriate message scheduling. 

As before, let x denote the vector of q-ary channel input symbols x 3 (j = 1, . . . , n) and let x\ denote bit i of symbol j. In 
the same way, the output of the channel is represented by y, y 3 and yf, and the errors are denoted by e, e 3 and e\, respectively. 
We assume w.l.o.g. that ej = x\ ®y\, where © denotes addition in GF(2), and e J = x 3 © y\ by extension. 

Let Xc( x ) be the characteristic function of the code, which evaluates to one if x is a codeword and to zero otherwise. 
Furthermore assuming that the transmitted codewords are equally likely, the a posteriori probability satisfies the proportionality 
relation 

/x|v(x|y) = xc(x)/ Y |x(y|x). 
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Fig. 3. Factor graph representation of the g-SC front-end for symbol x 3 (the symbol index j is omitted). 

The q-ary channel is assumed to be memoryless (on the symbol level), leading to a factorization of /y|x as 

n 

/x| Y (x|y) = xc(x) J] /v|aW) 

3=1 

=xc(x)f[(^/K, E |x(j/ i ,^i^)] 

3=1 I e3 J 

n 

= Xc(x) n^-B|x(y J ,a; J ®y J \x J ), 

3=1 

where in the last step we used the fact that x^ , & 1 and y J are related in a deterministic way. 
Given an error symbol e J , the bit-layers are independent of each other, leading to 

/x| Y (x|y) = xc(x) fl ^fE(e j )f Y \x,E(y j \ xj > ej )} 

n ( rn \ 

3=1 I t=l J 

where = f Y i {x i >B f. 

This factorization is shown for one symbol in Fig. [3] where the edges on the right side are connected to the check nodes 
of the LDPC code, i.e. the factorization of the characteristic function xc(x). In the following, we will denote a message sent 
from a variable node a; to a function node / and vice versa as fj,^f(x) and Hf-, x (x), respectively [13]. 



A. Message Passing Rules 

We apply the sum-product algorithm to compute the marginals fx^A-yi^H W) f° r a ^ symbols j and all bits i, i.e. to perform 
a posteriori decoding for every bit of the transmitted codeword. In the following we will describe the operations for a single 
q-ary symbol and therefore omit the symbol index j for convenience. 

First, we define the local functions and derive the message passing rules for the function nodes / c h and Since the variable 
nodes have degree two, they just forward the incoming messages, e.g. fi ei -*f A (ci) = >ei (ei) and /i ei _ > ^ B (ei) = 
A*/d,-* ei (ei). 

1) Bit Layer Channels: In the binary sub-channels, Xi, yi and ej have to sum up to zero in GF(2), therefore the local 
functions / c h are given by 

feh(Vi\xi,ei) = [yi = x t © ei], 
where [P] evaluates to 1 if the expression P is true and to otherwise. 
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Having denned the local function, we can derive the message passing algorithm by applying the rules of the sum-product 
algorithm. Given a message ^ Xi ^} ch {xi) and a received bit j/j, the channel function node computes 



M/oh^e 4 (ei) = fch(yi\xi,ei) ■ fJ, Xi ^(xi) 



where X)~{a} denotes the marginalization over all variables except a. In the same way, the message sent back to variable node 
Xi is computed as 



»U->Xi(Xi) = I fch{yi\xi,ei) -Mei-ZoCei) j 



= lJ-e i ^f A {x i ®y i ). 

2) Error Patterns: The function node Je represents the probability that a certain binary error pattern occurs. In the g-ary 
symmetric channel, every error pattern has the same probability ^-y, except the all-zero pattern that has probability 1 — e, thus 



fsiei, ■ ■ ■ ,e m ) = 

The derivation of the outgoing messages of Je leads to 



1 - e, if ei = . . . = e m = 
, otherwise. 



~{e,} V »V» / 

(1 - p) - y9 w + (1 - Afl) ife* = 
TpT if * = 1, 



where /^j is defined as 



B. Simplification of the Messages 

Since all involved variables are binary, we can represent the messages by scalar quantities. For the messages from and to 
the variable nodes x\ of the LDPC code, we use log-likelihood ratios of the corresponding messages 

.M*i-/*(0) 



L a .t = log 
LchA = log 



Mxi-/ch(l)' 

(°) 



M/ch^X, (1) ' 

The messages from the function nodes / C h to the node Je are defined as the probability of no error 

Pi = M/oh-e^O) = /z ej _>/ E (0), 
and the messages from the function node Je are defined as the error probability 

M/ E ^e 4 (0) +M/ E - ei (l)' 
Using these definitions, we can reformulate the computation rules at the function nodes as 

e i^i.(i_2 yi ) 

P* = "EH ~^T' 

e 2' + e 2' 

and 

e 

, _ 9-1 



e 



2e + /3 w (g-«j-l) 



(16) 



x 




Fig. 4. EXIT function for the g-SC front-end with BEC (analytical) and Gaussian (simulated) a priori messages for e = 0.25, m = 4 (lower curves) and 
m = 8 (upper curves). 



For the initialization, we set all a-priori L-values L a j to zero, leading to /J™ = 2 ' m 1 - ) = 2/q. Inserting this in (fl6b leads 

to 

£l ~ % — T) ~ £BSC - 

Finally, the messages sent from the function nodes / c h to the variable nodes Xi are computed as 

L ch ,i = (1 - 2j/i) • log - — — . 

C. Complexity 

Clearly, the processing complexity of the front-end is dominated by ( fl5l l and is thus O(logg) per symbol, as in verification 
decoding [1]. This means that the overall decoding complexity scales linearly in the number of transmitted bits, independently 
of the symbol alphabet size q. Complexity may be further reduced by a constant factor (i.e. not its order) if the front-end 
messages are not recomputed on every LDPC decoder iteration; this is often sufficient in practice. 

V. EXIT Analysis of the g-SC Front-end 

In this section, we analyze the g-SC front-end using extrinsic information transfer (EXIT) charts [14], which will later also 
be used for the design (optimization) of LDPC codes. 

Let the a priori and extrinsic messages for the front-end denote the messages between the bit layer sub-channels / c h and the 
bit nodes Xi of the LDPC code, that is, jtx Xi — >/ h( a; i) an d ^fch^xiixi), respectively. For the EXIT chart analysis, the front-end 
is characterized by the transfer function I e {I a ), where I a and I e denote the a priori and the extrinsic mutual information, 
respectively, between the messages and the corresponding bits. 

First, we derive the EXIT function of the front-end for the case when the a priori messages are modeled as coming from 
a binary erasure channel (BEC). In this special case, the EXIT functions have important properties (e.g. the area-property), 
which were shown in [14]. 

In the case of a BEC, the variables Xi are either known to be error-free or erased, thus simplifying the computation of (3u\ 
defined in (TT3T > as follows. The product in (fT5T l has m — 1 factors, taking on the value 1/2 if the bit Xi is erased, if the bit 
Xi is not in agreement with the received bit j/j and 1 otherwise. If at least one message is not in agreement with the received 
bit, a symbol error occurred and /3u\ = 0. Otherwise, [3ui = 2~*, where t denotes the number of erased messages. 

Since (3ui — corresponds to ej = 1/2 through (floT i. this case is equivalently described by an erasure. Therefore, the 
extrinsic messages can be modeled as being transmitted over a binary symmetric erasure channel (BSEC), with parameters 
e m _t and S m -t given by (ffji and ©, respectively, since t erasures correspond to the situation in layer i = m — t in the layered 
scheme of Section [11] Its capacity is thus 

^(1-^0(1-^^)). (17) 
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In order to compute the extrinsic information I e (I a ) at the output of the front-end as 

m—l 

i e {i a ) = iMQ, (18) 

t=0 

we have to compute the probability At(J Q ) that t messages are erased given a priori mutual information I a . Let A a = 1 — I a 
denote the probability that an a priori message is erased. The probability that t = 0, . . . , m — 1 out of m — 1 messages are 
erased is computed as 

m — 1 
t 

The EXIT function for BEC a priori messages can therefore be computed using (O, (0), ( fTTl ) and $1% in ( fT~8b - 

According to the area theorem of EXIT charts [14], the capacity of each bit layer is given by the area below the EXIT 
function. Therefore, the capacity of the overall channel is given by 

rl m— 1 



l-J^C" 1 -*. (19) 



/•l lib — j . 

Go, = m / Y] I t \ t (I a )dI a 
Jo t=o 

m — 1 „i 

^ It Xt(Ia)dI a 

Jo 



t=0 

ro-1 / i\ /■! 

-nVii" 1 ; / (i-iafir'-'dia 



^ \ t 

t=0 



m—l 



(to-1)! t\{m-l-t)\ 



= m > L , 

= Y It = c «- sc > (20) 

t=0 

where we used the fact that the integral corresponds to the definition of the beta function. The last sum in d20b is indeed equal 
to the capacity of the q-SC, as was already shown in the proof of Theorem 1 . 

Without iterative processing, the maximum achievable rate is given by / e (0). In that case, all a priori messages are erased 
and one has 

Z ro _! = l-h ( Cl ) = 1 - h (^py) J = Cbsc, 

which corresponds to the simplistic coding approach where the q-SC is decomposed into to BSCs. 

Figure 2] shows an EXIT chart for BEC a priori messages. The value at I a = denotes the maximum achievable rate without 
iterative processing and decoding and the area below the EXIT function corresponds to the capacity of the q-SC. When using 
this front-end with an LDPC code, the a priori messages can not be modeled as coming from a BEC channel. For code design, 
we will make the approximation that the messages are Gaussian distributed, which is a common assumption when using EXIT 
charts. The simulated EXIT function using Gaussian a priori messages is also shown in Figure [4] 



VI. LDPC Design and Construction 

After obtaining the EXIT function of the q-SC front-end, we can design an LDPC code. Code design is an optimization 
problem that selects the degree distributions of the code in order to maximize the rate, under the constraint that the EXIT 
function of the overall LDPC code does not intersect the EXIT function of the q-SC front-end. Joint optimization of the 
variable and check node degree distributions is a nonlinear problem, but it was shown in [12] that the optimization of the 
check node degree distribution given a fixed variable node degree distribution is a linear programming problem, which can be 
solved efficiently. 

For the optimization in [12], it was assumed that the decoder is allowed sufficient iterations between variable and check nodes 
before performing an outer iteration (i.e. processing the front-end). While this assumption is necessary for the optimization 
of the code, it can be dropped when running the actual decoder if the graph is free of (short) cycles, since the choice of 
scheduling in a cycle-free graph has no influence on the final result. 

Since we want the variable nodes to have the same convergence behavior, we use variable-regular codes and fix the variable 
node degree to d v = 3. Furthermore, we set the maximum check node degree to d c , max = 50 in order to obtain results for 
which codes with practical lengths can be constructed. The resulting optimized check node degree distribution [9, Chap. 3] has 
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Fig. 5. Rate of optimized codes versus normalized capacity of the q-SC for m = 4 and m = 8. 



Optimized code 
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Fig. 6. Bit error rate simulation for rate 1/2 LDPC decoding (m = 4, k = 1500 symbols). 

only few non-zero values in general, e.g. the optimized distribution for m = 4 and rate R = 1/2 is p4 = 0.1087, p5 = 0.6753, 
P49 = 0.0299 and p 50 = 0.1861. 

Fig. [5] shows the normalized capacity of the q-SC and the obtained optimized code rates versus the error probability e of 
the g-SC for m = 4 and m = 8. Using this code optimization, we are able to design codes that perform close to capacity over 
a wide range of error probabilities for moderate q. 

For the actual code construction, we used a modified version of the PEG algorithm [15], [16], which allows us to construct 
variable-regular codes with a given check node degree distribution. In order to avoid (short) cycles in the overall factor graph, 
one has to ensure that the bits of a given q-aiy symbol do not participate in the same parity check. This can be achieved by 
using an appropriate interleaver between the q-SC front-end and the LDPC decoder. 

To verify our derivations, we performed bit error rate simulations for an optimized binary LDPC code of rate R — 1/2 
and length N = 12000, shown in Fig. [6] This code is optimized for a q-SC with m = 4, thus N = 12000 bits correspond 
to k = 1500 g-ary symbols. It has a decoding threshold of e = 0.26 (the Shannon limit for R = 1/2 is at e = 0.29). As a 
comparison we also show the bit error rate for a regular LDPC code with d v = 3 and d c — 6. 

VII. Generalization to Conditionally Independent Binary Sub-channels 

The decoder in Section [Til] is seen to easily extend to g-ary channels with modulo-additive noise, where the m bits of the 
noise (error pattern) are independent if an error occurs, i.e. if at least one bit is nonzero. This assumption leads to define the 
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g-ary symmetric channel with conditionally independent binary sub-channels (g-SC*), with transition probabilities 
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and © denotes GF(2) addition (this definition encompasses all channels satisfying the above conditional independence assump- 
tion, from which it may be derived axiomatically). The defining parameters are the symbol error probability e and the m 
conditional probabilities eju, which are the bit error probabilities conditioned on the event that at least one of the other m — 1 
bits is in error. The marginal bit error probabilities will be = aeju. Since the q-SC* is strongly symmetric, the uniform 
input distribution achieves its capacity 
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When e is such that a = 1, the g-SC* reduces to m independent BSCs, while for e^u - , 
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The unnormalized bit APP (compare ST% ) is obtained as 
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leading to the LLR expression (l2Tb . The quantities p?, ft and 0L are as defined in Section [TTTJ Like in the special case of the 

q-SC, for (3L — > the channel LLR L c h tends to a constant, which is equal to the LLR of the binary sub-channel conditioned 
on the occurrence of an error on one or more of the other sub-channels. The initial value of can again be obtained by 
assuming a worst-case of p 3 k = |, 0L = 2~ m+1 in (fJTJ, yielding 
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which is the channel LLR of the marginal BSC. 
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